A metufod for identifying a disease-influencing gene, the 
method comprising the steps of: 

a) seled^ing individuals having a risk factor for a disease; 

b) creatinta queries regarding the individuals' behaviors and 
environments; 

c) storing tne queries on a server; 

d) providing! each of the individuals with a remotely 
programmable apparatus having a user interface for 
communicd^ting the queries and for receiving responses, 
and havin^/iommunication means for communicating 

through a communication network; 

from the server to each of the 
TTimable apparatuses; 
transmitting th^ responses of the individuals to the 
queries from thdy remotely programmable apparatuses to 
the server; 

g) creating a database of the individuals' behaviors and 
environments; 

h) using data mining te^niques to distinguish a group of 
individuals having simiW behavioral and environmental 



e) 



siting 

in 

with the Server 
transmitting t^e qy^ries 
remote^ 



profiles; 
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i) categoriziilX the group of individuals into at least two 
categories \ccording to the individuals' disease 
progression; 

j) determining [th^genotypes of] m iPRst one physiologic 
rharartPristic r^i^Vn to a aenntvpe for each of the at 
least two categoriekofmdividuals; 

k) using data mining Ahniques to find a gene difference 
between the at leist tWb calories of individuals based 
„ p^n iPastUeJ ycininnir. r.haracteristic relating 

.h^ ripnnt ypp for Vh of the categories of 

individuals : and 
I) identifying the disease-influencing gene. 



10. 



A method for \dentifying a disease-influencing gene, the 
method comprisino the steps of: 
f\ a) selecting inLduals having a risk factor for a disease; 

b) creating queriy^^^rding the individuals' behaviors and 
environments; 

c) storing the /querieV^ on/a server; 

d) providing Ldy^Xthe individuals with a remotely 
programmable appar^us having a user interface for 
communicating the que)tes and for receiving responses. 
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and \aving communication means for communicating 
with the server through a communication networi<; 

e) transmitting the queries from the server to each of the 
remotely programmable apparatuses; 

f) transmitting the responses of the individuals to the 
queries fr6m the remotely programmable apparatuses to 
the server, 

g) creating a cl^atabase of the individuals' behaviors and 
environments;^ 

h) distinguishing Vsroup of individuals having similar 
disease progc^ssHons; 

i) using data AiiningVechniques top^tegorize the group of 
individuals! into at l^sast t)A«5categories according to the 
individuals'Nagh^xAldfal and environmental profiles; 

j) determining at Ifiast A portion of the genotypes of the 

at least two categories \of individuals; 
k) using data mining techniques to find a gene difference 
between the at least two cWgories of individuals based 
at Iftast in part upon a aenV difference between the at 
^ least a portion of the re sp^riW aenotvpes: and 

I) identifying the disease-influencing gene. 
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19. A method\for identifying a disease-influencing substance, the 
method comprising the steps of: 

a) selecting individuals having a risk factor for a disease; 

b) creatind queries regarding the individuals' behaviors and 
environnients; 

c) storing th6 queries on a server; 

d) providing \ach of the individuals with a remotely 
programmable apparatus having a user interface for 
communicatinii the queries and for receiving responses, 
and having communication means for communicating 



g) 



h) 



i) 



with the server ihrougjKa comt 



lunication network; 
e) transmitting the ^ries from the server to each of the 
remotely progi^mrinable apparatuses; 

jponses of the individuals to the 
the reniptely programra^le apparatuses to 



'the 



transmitting 
queries fror 
the server 
creating a "blgtabj 



-oiT the individuals' behaviors and 



environments; 

determining a gene seaue nVfi of a genotype for each of 
[the genotypes of] the individuals; 
distinguishing a group of the \(idividuals having similar 
g ene seouences [genotypes]; 
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j) 



k) 



categorizing trie group of individuals into at least two 
categories acco\ding to their disease progressions; and 
using data mirfng techniques to find a disease- 



influencing SL 
environmental 
of individuals. 



bs^ance from the behavioral and 
refills be^een the at least two classes 



b) 



24. A database and data processing system for finding a disease- 
influencing Wne among individuals having a risk factor for a 
disease. the\latabase and data processing system comprising: 
a) a serverv for storing queries regarding the individuals- 
behavior ^nd environment and for storing the individuals' 
responses^o-the-q4^ries; 

at least one rimcltely programmable apparatus in 
communicatioi ilthth^ server, wherein the remotely 
programmable aD^ratus comprises: 

i) a ^er'lnterface for communicating the queries to 
the individiiils and for receiving the responses; 
and 

ii) communication^ means for receiving the queries 
from the server for transmitting the responses 



to the server; 
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c) genoty^ing means in communication with the server for 
obtaining at least a portion of the genotype of the 
individual Rufflcient to group indiv lHuals having a similar 
qenotvpe Aand 

d) data mining means in communication with the server, 
wherein the\iata mining means includes: 

i) means \or analyzing the responses in order to 
group-th^r4ndividuals having a similar behavioral 
and enYomental profile, a similar disease 
progressi2nJiaiid a similar genotype; 

ii) means for analyzing the responses in order to 
qroup the individuals having a similar disease 
progres,s»on; \ 

iii) means for analyzing the responses in order to 
group the individuals having a similar genotype; 
and 

iv) means for identifying the disease-influencing gene. 



27. A database^Wdata processing system for use in finding a 
disease-influenJinj substance among individuals having a risk 




factor for a dis'^se, the database and data processing system 
comprising: \^^^^^ 



:\HE54\001\M01.wpd A27001 2050352PM 



PAT-US\AM-00 



a) a Vrver for storing queries regarding the individuals' 
behavior and environment and for storing the individuals' 
responses to the queries; 

b) at leafet one remotely programmable apparatus in 
commurtication with the server, wherein the remotely 
programmable apparatus comprises: 

i) a user interface for communicating the queries to 
the mdividuals and for receiving the responses; 
and \ 

ii) commurticatioD means for receiving the queries 

\ ' ^ 

from the server a^d for transmitting the responses 

\ 

to the serve^ 

c) genotyping meins in corhmunication with the server for 
obtaining at iJa«;tW^ortion of the genotype of the 

individual; and \ 

d) data mining means in communication with the server, 

\ 

wherein the data mining means includes: 

i) means for analyzing the responses in order to 

\ 

group the individuals having a similar behavioral 
and environmental profile, a similar disease 
^ progression, and a similar at least a portion of 

the genotype; 



/ 
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